Add set data test cases #61

stevenhua0320 · 2024-08-12T03:23:57Z

Note that I still make the case x.size > 0 and res ==0 will give a message because as before Luke said that this case would make trivial clustering, which would make every data point as its own cluster, which is not desired clustering that we want.

…or of the test pass.

sbillinge

nice job! Please see comments.

sbillinge · 2024-08-12T11:23:47Z

src/diffpy/srmise/tests/test_dataclusters.py

+
+    # In the set data test, we test for these cases.
+    # (1) x and y are non-empty array values, and res is positive (the most generic case)
+    # (2) x and y are non-empty array values, and res is 0 (will produce a msg that makes trivial clustering)


I would not raise errors or warnings for this. It is just the identity (gives back the input unchanged) which is not that useful but not anything invalide. Please make an issue to address this in the docs, but not in the code.

sbillinge · 2024-08-12T11:30:44Z

src/diffpy/srmise/tests/test_dataclusters.py

+)
+def test_set_data(inputs, expected):
+    actual = DataClusters(x=inputs["input_x"], y=inputs["input_y"], res=inputs["input_res"])
+    assert actual == expected


just a note, the expected we would like to explicitly set all its attributes. this will also implicitly test other parts of the code that generate those attributes.

sbillinge · 2024-08-12T11:31:58Z

src/diffpy/srmise/tests/test_dataclusters.py

+        (
+            # case (4)
+            {
+                "input_x": np.array([]),


as I mentioned before, please use "x" here, not "input_x" to make the code more readable. Also, don't make the x-array empty, but just make them have different lengths.

sbillinge · 2024-08-12T11:34:49Z

src/diffpy/srmise/tests/test_dataclusters.py

+            "Sequences x and y must have the same length.",
+        ),
+        (
+            # case (5)


not needed. Please delete.

sbillinge · 2024-08-12T11:35:19Z

src/diffpy/srmise/tests/test_dataclusters.py

+                "input_y": np.array([3]),
+                "input_res": -1,
+            },
+            "Resolution res must be non-negative.",


please expand a bit. Will the user know what "resolution" means?

sbillinge · 2024-08-12T11:35:47Z

src/diffpy/srmise/tests/test_dataclusters.py

+            "Resolution res must be non-negative.",
+        ),
+        (
+            # case (2)


not needed, please remove. This will be handled in docs not in code.

sbillinge · 2024-08-12T11:36:27Z

src/diffpy/srmise/tests/test_dataclusters.py

+        ),
+    ],
+)
+def test_set_data_order_bad(inputs, msg):


this is a good test, nice job.

sbillinge · 2024-08-12T11:37:42Z

src/diffpy/srmise/tests/test_dataclusters.py

+            DataClusters(np.array([1, 2, 3]), np.array([3, 2, 1]), 4),
+        ),
+        (
+            # case (6)


this is not needed (because we are not catching res 0 any more)

sbillinge · 2024-08-12T11:38:24Z

src/diffpy/srmise/tests/test_dataclusters.py

+    # (4, 5) One of x and y is empty array, and res is positive
+    # (produce ValueError & msg "Sequences x and y must have the same length.", something like that)
+    # (6) Both x and y are empty array, and res is zero.
+


please remove these blank lines so the tests are as compact as possible.

sbillinge · 2024-08-12T11:38:26Z

src/diffpy/srmise/tests/test_dataclusters.py

+    # (2) x and y are non-empty array values, and res is 0 (will produce a msg that makes trivial clustering)
+    # (3) x and y are non-empty array values, and res is negative (will produce a ValueError,
+    # msg = please enter a non-negative res value)
+    # (4, 5) One of x and y is empty array, and res is positive


Here I woud test for len(x) != len(y) (ValueError) and not worry about the other cases. The user won't give a len(0) array on purpose and the code will error in that case anyway (on sort for example) in case the user does it inadvertently.

sbillinge

please see my inline comments. I also made some mods to the error messages, and also put the docstrings into numpy format. Please can you take a look and try and use the same pattern for all docstrings.

The main issue remaining is that test_set_data is not testing anything atm.

sbillinge · 2024-08-13T07:14:01Z

src/diffpy/srmise/tests/test_dataclusters.py

+                "y": np.array([3, 2, 1]),
+                "res": 4,
+            },
+            DataClusters(np.array([1, 2, 3]), np.array([3, 2, 1]), 4),


This doesn't appear to be testing anything as your actual and expected are both just running the same functions.

sbillinge

thanks this is good. Please see the comment

sbillinge · 2024-08-13T14:28:12Z

src/diffpy/srmise/tests/test_dataclusters.py

+                "DONE": 3,
+                "lastcluster_idx": None,
+                "status": 1,
+            }
        ),
    ],
 )
 def test_set_data(inputs, expected):
    actual = DataClusters(x=inputs["x"], y=inputs["y"], res=inputs["res"])


this is better. Strinctly this still doesn't test set_data alone. It tests the object constructor. I think this is ok, but we may want to make clear and set_data as private functions. Then we don't need tests (or docstrings in prinicple) for them and we just test the constructor (the __init__).

Whether or not to do this depends where else these functions rae used. Do we want to make them available to users to use, or are they just being used in init alone or in init and a few other places.

These are small things, but once we touch the code we want to leave it better than when we arrived, and it is also a good learning experience.....

Could you look how these two functions are used and we can decide. If we make them private functions we can leave this test as it is but just change its name.

this is better. Strinctly this still doesn't test set_data alone. It tests the object constructor. I think this is ok, but we may want to make clear and set_data as private functions. Then we don't need tests (or docstrings in prinicple) for them and we just test the constructor (the __init__).

Whether or not to do this depends where else these functions rae used. Do we want to make them available to users to use, or are they just being used in init alone or in init and a few other places.

These are small things, but once we touch the code we want to leave it better than when we arrived, and it is also a good learning experience.....

Could you look how these two functions are used and we can decide. If we make them private functions we can leave this test as it is but just change its name.

I'm certain that these two functions are only used in the constructor. It should be OK to change them into private functions.

super, let's do this. Just

change the name of this test to something like test_DataClusters_constructor since this is what it does anyway.

add an underscore to the beginning of the clear and set_data functions. You can leave the docstring since we already wrote it.

revisit the test_clear tests. We want to remove this test, but let's make sure that this behavior is being tested in the constructor test, so move over anything we need from there.

sbillinge · 2024-08-13T16:00:22Z

I made a few final changes and merged. Nice job there. Please make sure to check any docstrings on functions you touch are in the numpy format (see my examples) and also take these lessons forward about the tests, but nice job!

Please make an issue to remove the _clear() function sometime in the future (preferably when we have more tests). Since it is only used in the constructor it will never encounter a version of the object that is not empty. for now, leave it as an issue as we want to do more functional testing before we make any possibly breaking changes when there are few tests in the code.

The focus now is for you to successfully use the code in its current state.

* Lint check & fix to python3 format (#18) * lint check and change files from python2 to python3 * pre-commit check for these files * lint check & change to python3 & pre-commit check (#19) * lint check & change to python3 & pre-commit check * [pre-commit.ci] auto fixes from pre-commit hooks --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * lint check and fix print and exception python2 issue (#20) * lint check and fix python2 print and exception issues (#21) * lint check and fix python2 print and exception issues * [pre-commit.ci] auto fixes from pre-commit hooks --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * finish parenthesizing print statements (#24) * finish parenthesizing print statements * [pre-commit.ci] auto fixes from pre-commit hooks --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * fix too many leading #, import modules, and unused var (#29) * requirements (#30) * fix import module not used & string check (#25) * fix too many leading "#" in string block (#26) * lint check, remove unused import modules & remove too many "#". (#27) * remove unused modules, ambiguous variable name (#28) * cleaning (#31) * requirements * clean out __init__ * replace ### * ins not none in modelevaluators base * Copyright (#32) * requirements * basefunction * all the copyright statements * lint check, fix break import modules, remove unused import modules, remove some # (#33) * fix break import modules, remove unused import modules, fix docstring length (#34) * fix formatting issue and typo in copyright (#35) * clean out inits (#38) * clean out inits * [pre-commit.ci] auto fixes from pre-commit hooks * dataclusters.py, modelevaluators/aicc and modelparts.py --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * peakextraction.py and init (#40) * move untrack doc and requirement files (#41) * move untrack doc and requirement files * add requirement in run.txt * [pre-commit.ci] auto fixes from pre-commit hooks --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * add pyproject.toml (#42) * add pyproject.toml * [pre-commit.ci] auto fixes from pre-commit hooks * update classifiers pyproject.toml * Delete setup.py --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * move diffpy files to src dir (#44) * move diffpy files to src dir * [pre-commit.ci] auto fixes from pre-commit hooks * add Luke to authors --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * LICENSE (#45) * add two LICENSE.rst files into cookiecutter * fix LICENSE.rst and LICENSE_PDFgui.rst with correct references and year * resolve pdfdataset.py conflict --------- Co-authored-by: Simon Billinge <[email protected]> * add untrack files and add cookiecut.rst news (#46) * add untrack files and add cookiecut.rst news * delete README.txt * fix py2 -> py3, fix broken import, remove deprecation warning (#47) * fix py2 -> py3, move deprecation warning * fix search & split in binary files * fix broken import, remove deprecated pkg_resource (#50) * change import path to make it work. (#48) * fix import modules, py2->py3 (#49) * fix broken import in doc, change README to rst file. (#51) * fix broken import in doc, change README to rst file. * fix os getcwd method * fix p2 to p3 (#52) * add test for dataclusters (#54) * add test for dataclusters * define eq method in dataclusters.py * change parametrization form * add one more case and change reference name to actual * delete comment * add two more tests for DataClusters class function. * change in docstring for clearer explanation for clear method, remove duplicated case for testing behavior, remove other tests. * change clear method docstring into numpydoc format. Delete dtype for numpy array. * remove block * Make edition to condition on res, refactor for setdata to make behavior of the test passed. * change condition on res * add condition on x and res are incompatible, update test. * revert change in setdata method. * Eq tests (#59) * remove diffpy/srmise tree * test for eq * add attributes in eq method (#60) * Add set data test cases (#61) * add test cases to test files and make edition to make sure the behavior of the test pass. * [pre-commit.ci] auto fixes from pre-commit hooks * change case in test__eq__ to be compatible with the behavior of setdata * delete text and redundant tests * tweaking error message in DataClusters * [pre-commit.ci] auto fixes from pre-commit hooks * update test for checking implicit attributes for setdata function * [pre-commit.ci] auto fixes from pre-commit hooks * update test for setdata function * update setdata test to right format. * update to constructor test & make setdata clear function private * final tweaks to tests by Simon * fix actual_attribute typo * final refactor of actual_attr --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Simon Billinge <[email protected]> * fix arbitrary.py to numpydoc format (#68) * fix arbitrary.py to numpydoc format * pre-commit fix * change start sentence to 'The' * print things correctly (#71) * print things correctly * change to f string * reduce print to one line * change createpeak to actualize function (#72) * fix import and counting to make it work (#74) * refactor makeclusters to make it work (#73) * deprecation remove (#78) * deprecation remove * fix to right behavior * Revert "refactor makeclusters to make it work (#73)" (#79) This reverts commit 3773bcf. * try out py2 before py3 refactor to make sure correct workflow (#75) * fix false counting and numpy to int (#80) * fix false counting and numpy to int * [pre-commit.ci] auto fixes from pre-commit hooks --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * numpydoc edition (#81) * change peakextraction function to numpydoc * pre-commit run * remove unused import * numpydoc build (#82) * numpydoc build on peakstability (#83) * numpydoc build for ModelCluster (#85) * numpydoc build for multimodelselection.py (#87) * numpydoc documentation build for ModelCluster class (#86) * numpydoc build for pdfdataset (#88) * numpydoc build for pdfpeakextraction.py (#89) * numpydoc build for gaussianoverr.py (#91) * numpydoc build for gaussianoverr.py * [pre-commit.ci] auto fixes from pre-commit hooks * fix pre-commit --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * terminationripples.py numpydoc build (#92) * numpydoc build for gaussian.py (#90) * numpydoc build for gaussian.py * [pre-commit.ci] auto fixes from pre-commit hooks * pre-commit fix * update for FWHM and maxwidth * update for starting sentence --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * numpydoc build for base.py (#95) * numpydoc build for polynomial.py (#97) * numpydoc build for fromsequence.py (#99) * numpydoc build for nanospherical.py (#98) * numpydoc build for base.py in Baseline class (#96) * numpydoc build for base.py * [pre-commit.ci] auto fixes from pre-commit hooks --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * numpydoc build for aic.py (#93) * numpydoc build for aicc.py (#94) * numpydoc build for ModelCovariance (#84) * numpydoc build for ModelCovariance * update format type and fix indentation issue * numpydoc build for modelparts.py (#100) * numpydoc build for modelparts.py * [pre-commit.ci] auto fixes from pre-commit hooks --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * numpydoc build for basefunction.py (#101) * api workflow build for diffpy.srmise (#102) * api workflow build for diffpy.srmise * [pre-commit.ci] auto fixes from pre-commit hooks --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * add changed news (#103) --------- Co-authored-by: Rundong Hua <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

stevenhua0320 and others added 3 commits August 12, 2024 11:20

add test cases to test files and make edition to make sure the behavi…

95fc07e

…or of the test pass.

[pre-commit.ci] auto fixes from pre-commit hooks

28c6ea7

change case in test__eq__ to be compatible with the behavior of setdata

ce1d97f

sbillinge reviewed Aug 12, 2024

View reviewed changes

stevenhua0320 and others added 3 commits August 13, 2024 13:56

delete text and redundant tests

0faa2ca

tweaking error message in DataClusters

6a011a2

[pre-commit.ci] auto fixes from pre-commit hooks

83eac4c

sbillinge reviewed Aug 13, 2024

View reviewed changes

stevenhua0320 and others added 5 commits August 13, 2024 16:06

update test for checking implicit attributes for setdata function

9f17d25

[pre-commit.ci] auto fixes from pre-commit hooks

9d84a9f

update test for setdata function

321f47d

update test for setdata function

3369069

update setdata test to right format.

3ef8a5b

sbillinge reviewed Aug 13, 2024

View reviewed changes

stevenhua0320 and others added 4 commits August 13, 2024 22:55

update to constructor test & make setdata clear function private

cbe85f1

final tweaks to tests by Simon

759a7e8

fix actual_attribute typo

6f29c60

final refactor of actual_attr

d102f5f

sbillinge merged commit d1bf3d4 into diffpy:Cookie Aug 13, 2024
3 checks passed

Add set data test cases #61

Add set data test cases #61

Uh oh!

Conversation

stevenhua0320 commented Aug 12, 2024

Uh oh!

sbillinge left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sbillinge left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sbillinge left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sbillinge commented Aug 13, 2024

Uh oh!

Uh oh!

Uh oh!